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Abstract — In this paper we introduce deterministic mx n RIP 
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columns of these matrices are binary BCH code vectors that their 
zeros are replaced with 1 (excluding the normalization factor). 
The samples obtained by these matrices can be easily converted 
to the original sparse signal; more precisely, for the noiseless 
samples, the simple Matching Pursuit technique, even with less 
than the common computational complexity, exactly reconstructs 
the sparse signal. In addition, using Devore's binary matrices, we 
expand the binary scheme to matrices with {0, 1, — 1} elements. 

Index Terms — Compressed Sensing, Deterministic Matrices, 
Restricted Isometry Property , BCH codes. 



I. Introduction 

DECREASING the number of required samples for unique 
representation of a class of signals known as sparse has 
been the subject of extensive research in the past five years. 
The field of compressed sensing which was first introduced in 
HI and further in J2|, 0, deals with reconstruction of a n x 1 
but fc-sparse vector x nxl from its linear projections (y mx i) 
onto an m-dimensional {m -C ri) space: y mx i = $ mx Axi- 
The two main concerns in compressed sensing are 1) selecting 
the sampling matrix <& mX n an d 2) reconstruction of x nx i 
from the measurements y mx i by exploiting the sparsity con- 
straint. 

In general, the exact solution to the second problem, 
is shown to be an NP-complete problem J4); however, if 
the number of samples (to) exceeds the lower bound of 
to > 0(k log(n/fc)J, £i minimization (Basis Pursuit) can 
be performed instead of the exact £q minimization (sparsity 
constraint) with the same solution for almost all the possible 
inputs Q], Q. There are also other techniques such as greedy 
methods Q, that can be used. 

The first problem (sampling matrix) is usually treated by 
random selection of the matrix; among the well-known random 
matrices are i.i.d Gaussian HI and Rademacher [7] matrices. 
Before addressing some of the deterministic matrix construc- 
tions, we first describe the well known Restricted Isometry 
Property (RIP) 0: 

We say that the matrix A mxn obeys RIP of order k with 
constant < #j- < 1 (RIC) if for all fc-sparse vectors x nxl 
we have: 



1 - 5 k < 



lAxll 



< 1 



(1) 
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In other words, RIP of order k implies that each k columns 
of the matrix A resembles a quasi-orthonormal set: if B mX fc 
is formed by k different columns of A, all the eigenvalues 
of the Grammian matrix B T B should lie inside the interval 
[1 - 5 k , 1 + 5k\. 

RIP is a sufficient condition for stable recovery. The basis 
pursuit and greedy methods can be applied for recovery of 
fc-sparse vectors from noisy samples with good results if the 
matrix A obeys RIP of order 2k with a good enough constant 

5 2k ED. @. " 

In this paper we are interested in deterministic as opposed 
to random sampling matrices. Deterministic sampling matrices 
are useful because in practice, the sampler should finally 
choose a deterministic matrix; realizations of the random 
matrices are not guaranteed to work. Moreover, by proper 
choice of the matrix, complexity or compression rate may be 
improved. In deterministic sampling matrix, we are looking 
for to x n matrices which obey the RIP of order k. It is 
well-known that any k columns of a k x n Vandermond 
matrix are linearly independent; thus, if we normalize the 
columns, for all values of n, the new matrix satisfies the RIP 
condition of order fc. In other words, arbitrary RIP-constrained 
matrices could be constructed in this way; however, when 
n increases, the constant 5k rapidly approaches 1 and some 
of the k x k submatrices become ill-conditioned [9] which 
makes the matrix impractical. In ifTUl . p 2 x p r+1 matrices 
(p is a power of a prime integer) with 0, 1 elements (prior 
to normalization) are proposed which obey RIP of order k 
where kr < p. Another binary matrix construction with 
to = fc2 c, ' 1 ° slogTl ) measurements (E > 1) is investigated in 
ifTD which employs hash functions and extractor graphs. The 
connection between coding theory and compressed sensing 
matrices is established in [12| where second order Reed- 
Muller codes are used to construct 2x2 2 matrices with 
±1 elements; unfortunately, the matrix does not satisfy RIP 
for all k-sparse vectors. Complex to 2 x to matrices with chirp- 
type columns are also conjectured to obey RIP of some order 
lf]~3l - Recently, almost bound-achieving matrices have been 
proposed in lfl4l which, rather than the exact RIP, satisfy 

statistical RIP (high probability that RIP holds). In this paper, 

o( X log j) 

we explicitly introduce (2 l — 1) x 2 2 J matrices with 
±1 elements which obey the exact RIP for k < 2 J . The 
new construction is achieved by replacing the zeros of the 
linear binary block codes (specially BCH codes) by —1. In this 
approach, we require binary codes with minimum distances as 
large as almost half their code length; the existence of these 
codes will be shown by providing BCH codes. 

The rest of the paper is organized as follows: In the next 
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section we show the connection between linear block codes 
and construction of RIP-fulfilling ±1 matrices. In section 
Hill we introduce BCH codes that meet the requirements to 
produce compressed sensing matrices. Matrix construction 
and recovery of the sparse signal from the samples using 
the matching pursuit method is discussed in section [TV] The 
introduced matrices are combined with a previous scheme 
to form 0, ±1 matrices in section [V] and finally, section [VI] 
concludes the paper. 

II. Matrix Construction via Linear Codes 

In this section we will describe the connection between 
the sampling matrix and coding theory. Since the parameters 
k,n are used in both compressed sensing and coding theory, 
we distinguish the two by using the □ notation for coding 
parameters; i.e., h refers to the code length while n denotes 
the number of columns of the sampling matrix. 

Let C(h, k; 2) be a linear binary block code and lf ix i be 
the all 1 vector. We say C is 'symmetric' if l^ x i <= C- F° r 
symmetric codes, if a nxl is a code vector, due to the linearity 
of the code, complement of a„ x i which is defined as a„ x i © 
lnxi is also a valid code vector; therefore, code vectors consist 
of complement couples. 



Note that fixl , lf ix i G C and for each code vector a, either 
rf(0f ix i,a) or d(l Sxl ,a) cannot exceed ^; therefore, n — 
2d m in > 0. Combining Q and we have: 



(a,b)| < 



2d ri 



(5) 



which proves the claim on the inner product of the columns 
of A. Now let Bf lX fc be the matrix formed by k different 
columns of A. According to the previous arguments, B T B 
is a k x k matrix that has l's on its main diagonal while its 
off-diagonal elements have absolute values less than or equal 
It is now rather easy to complete the proof with 



h-2d„ 



to 

use of the Gershgorin circle theorem □ 

The above theorem is useful only when d m i n is close to 
§ (denominator for the upper bound of fc), which is not the 
case for the common binary codes. In fact, in communication 
systems, parity bits are inserted to protect the main data 
payload, i.e., k bits of data are followed by n — k parity 
bits. In this case, we have d m i n < h — k + 1; thus, to have 
d m in ~ § > the number of parity bits should have the same 
order as the data payload which is impractical. In the next 
section we show how these types of codes can be designed 
using the well-known BCH codes. 



Theorem 1: Let C(n,k;2) be a symmetric code with the 
minimum distance d m i n and let A~ 2 £_i be the matrix 
composed of code vectors as its columns such that from each 
complement couple, exactly one is selected. Define: 



1 



2A 



nx2 s -! V-)nx2 ,i - 1 ' ^ 



GF(2)[x] such that s (cc)|ir 2 

Then, A satisfies RIP with the constant 5k — (k — l)(l — in Galois theory, we know: 



III. BCH CODES WITH LARGE d min 

Since the focus in this section is on the design of BCH 
codes with large minimum distances, we first briefly review 
the BCH structure. 

BCH codes are a class of cyclic binary codes with h = 
2 m — 1 which are produced by a generating polynomial g(x) <E 

1-1 03J. According to a result 



for k < 



-2d„ 



1 (k is the RIP order). 



Proof. First note that the columns of A are normal. In fact 
2A~ v9 ;;_i — nfc-i is the same matrix as A where zeros 
are replaced by —1; hence, absolute value of each element of 
A is equal to which reveals that the columns are normal. 

Vn 

To prove the RIP, we use a similar approach to that of 1101 : 
we show that for each two columns of A, the absolute value 



of their inner product is less than 



-2d,, 



Let a,-j x i , b„ x i 



be two different columns of A and a fl x i , b fl x i be their 
corresponding columns in A. If a and b differ at I bits, we 
have: 



(a, b) = i ( 1 x (n - + (-1) x l) = 

n \ n 



(3) 



Moreover, b and a © l^ x i (complement of a) differ at h — I 
bits and since all the three vectors {a, a© lfj X i, b} are 
different code words (from each complement couple, exactly 
one is chosen and thus b =^ a© lf ix i), both I and n— I should 
be greater than or equal to d m i n , i.e.,: 



h — I > d„ 



dmin — ' — ^ d 7n in 

In - 2l\ <n— 2d min 



(4) 



1 = 



n 



(x 



(6) 



r e GF(T 



Hence, the BCH generating polynomial can be decomposed 
into the product of linear factors in GF(2 m )[x]. Let a E 
GF(2 m ) be a primitive root of the field and let a 1 be one of the 
roots of g(x). Since g(x) G GF(2)[x], all conjugate elements 
of a 1 (with respect to GF(2)) are also roots of g(x). Again 
using the results in Galois theory, we know that these conju- 
gates are different elements of the set {a* 2 }"^ 1 - Moreover, 
since a 2 _1 = 1, i\ = ^(mod 2™ — 1) implies a n = a 12 
which reveals the circular behavior of the exponents. 

The main advantage of the BCH codes compared to other 
cyclic codes is their guaranteed lower bound on the minimum 
distance fl5l : if a 11 , . . . , a %d are different roots of g(x) (not 
necessarily all the roots) such that ii, . . . , id form an arithmetic 
progression, then d m in > d+1. 

Now we get back to our code design approach. We construct 
the desired code generating polynomials by investigating their 
parity check polynomial which is defined as: 



h(x) ± 



9{x) 



(7) 
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In other words, each field element is the root of exactly one 
of the g(x) and h(x). We construct h(x) by introducing its 
roots. Let I < rh be an integer and define 



j(0 _ 



{oP, 



L +2 l -l 



} 



(8) 



Note that the definition of G^ depends on the choice of the 
primitive element (a). We further define Ti^j as the subset of 
q!~) which is closed with respect to the conjugate operation: 



(9) 



The above definition shows that if r G then its conjugate 
r 2 ' e H^. Now let us define h(x): 



h(x) = U (x-r) 



(10) 



As discussed before, if r is a root of h(x), all its conjugates 
are also roots of h(x); therefore, h(x) S GF(2)[x], which is 
a required condition. Moreover, 



1 



=*> (l + ac)|ft(a;) (11) 
which means that the all one vector is a valid code word: 
c=[l,...,lf 



c{x) = 1 + X + . . . - 



1 



'l + a 



= c(z)/i(x) (12) 



Hence, the code generated by g(x) 



i^y- is a symmetric 
code and fulfills the requirement of Theorem Q] For the 
minimum distance of the code, note that the roots of h(x 



form a subset of G^\ thus, all the elements in GF(2 m )\Q, 
are roots of g(x): 

V + 2 l < j < 2™ - 



g(a>) = 



(13) 



Consequently, there exists an arithmetic progression of length 
2 m ~ 1 — 2 l — 1 among the powers of a in roots of g(x). As a 
result: 



> (2 r: 



-2'-l) 



1 = 2" 1 " 1 - 2' 



(14) 
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In coding, it is usual to look for a code with maximum d 
given n, k. Here, we have designed a code with good d 
for a given n but with unknown k: 

h = k + deg(g(x)) 

^k = h — deg(g(x)) 

= (deg(g(x)) + deg(h{x))) - deg(g(x)) 

= deg(h{x))=\H { 2\ (15) 

The following theorem reveals how \H^, \ should be calcu- 
lated. 

Theorem 2: With the previous terminology, | is equal 
to the number of binary sequences of length rh such that if 



the sequence is written around a circle, between each two l's, 
there exists at least rh — I — 1 zeros. 

Proof. We show that there exists a 1-1 mapping be- 
tween the elements of 7t£ and the binary sequences. Let 
(&m-i, . . . , 6o) £ {0,1}™ be one of the binary sequences 
and let (3 be the decimal number that its binary representation 
coincides with the sequence: 



13 = (bfh-i 



■ bo) 



rh— 1 

E 

i=0 



b,2 l 



(16) 



We will show that a 13 S Ti^ . For the sake of simplicity, let us 
define (3j as the decimal number that its binary representation 
is the same as the sequence subjected to j units of left circular 
shift (/3b =/3): 



fh 

Pm-l 
Now we have: 



'rh-l 



JO) 2 



'rh-2 



, 6n6rf, 



0"m-l )2 



(bm-3 ■ ■ ■ bab m .-ibm-2)2 



(b b 7 % 



(17) 



2(3j — 2 x (bfh-x-j . . . bobfo-ibfh-jji 



— 2 m bm-l-j + (bm-2-j ■ ■ ■ fro&m-l&m-jO)2 

= f3 j+1 (mod 2* - 1) 
^ (3j = 2 3 (3 (mod 2™ - l) 
=> cP' = a v P (18) 

which shows that {a^ j }j are conjugates of ofi . To show oP £ 
we should prove that all the conjugates belong to G!$, 
or equivalently, we should show < 0j < 2 m ~ 1 + 2 l — 1. It 
is clear that < (3j\ to prove the right inequality we consider 
two cases: 

1) MSB of /3j is zero: 



C rh-l 



< 2 r 



2' - 1 (19) 



2) MSB of (3j is one; therefore, according to the property 
of the binary sequences, the following m — l — 1 bits are 



bm—l-i — 1 =^ 



3=0 

(3 j < 2™- 1 + 2 l - 1 (20) 



Up to now, we have proved that each binary sequence with 
the above zero-spacing property can be assigned to a separate 
root of h(x). To complete the proof, we show that if the binary 
representation of [3 does not satisfy the property, then we have 
ofi £ 7t~l . In fact, by circular shifts introduced in (3j, all the 
bits can be placed in the MSB position; thus, if the binary 
representation of f3 does not obey the property, at least one 
of the (3j's should be greater than 2 m_1 + 2 l — 1. This means 
that at least one of the conjugates of a 13 does not belong to 

a 

™ Tfl. 
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Theorem [2] relates the code parameter k to a combinatorics 
problem. Using this relation, it is shown in Appendix lAl that 

IV. Sampling and Reconstruction 

In previous sections, we presented the principles of matrix 
construction. In this section, in addition to a stepwise instruc- 
tion set, we focus on the column selection procedure from 
complement pairs. In the second part of this section, we show 
that the original sparse vector can be reconstructed from the 
samples by simple methods such as Matching Pursuit. 

A. Matrix Construction 

Recalling the arguments in the previous section, the choice 
of the polynomial g(x) depends on the choice of the primitive 
root. In addition to this degree of freedom, from Theorem[T] no 
matter which code vectors from complement sets are selected, 
the generated matrix satisfies RIP. Hence, for a given primitive 
element, there are 2 2 (there are 2 k ~ 1 complement pairs) 
possible matrix constructions. Among these huge number of 
possibilities, some of them have better characteristics for 
signal recovery from the samples. More specifically, we look 
for the matrices such that columns are closed with respect to 
the circular shift operation: if a — [ax,..., a^] T is a column of 
A, for all 1 < j < h, a.j = [dj, dj+x, ■ ■ ■ , a>n, o-Xi ■ ■ ■ , a j-i] T 
is also a column of A. 

The key point is that the BCH codes are a subset of cyclic 
codes, i.e., if c^ x i is a code vector, all its circular shifts are 
also valid code vectors. Thus, if we are careful in selecting 
from the complement sets, the generated sampling matrix will 
also have the cyclic property. For this selection, it should be 
noted that if ajj X i)bnxi is a complement pair and Cfi X i is 
a circular shifted version of a^ x i, the overal parity (sum of 
the elements in mod 2) of a^ x i and bs x i are different (each 
code vector has 2 m — 1 elements which is an odd number) 
while a^xi and c^xi have the same parity. Therefore, if we 
discard the code vectors with even (odd) parity (from the set 
of all code vectors), we are left with a set half the size of the 
main set such that from each complement set exactly one is 
selected while the set is still closed with respect to the circular 
shift operation. The selection algorithm is as follows: 

1) For a given k (compressed sensing parameter), let i = 
|~log 2 (fc)] and choose rh> i (the number of compressed 
samples will be m = 2 m — 1). 

2) Let Ttseq be the set of all binary sequences of length fa 
such that l's are circularly spaced with at least i zeros. 
In addition, let Hdec be the set of decimal numbers such 
that their binary representation is a sequence in Tt seq . 

3) Choose a as one of the primitive roots of GF(2 m ) and 
define: 

H = {a r \ re H dec } (21) 

4) Define the parity check and code generating polynomials 

as: 

h(x) = J] (x - r) (22) 

ren 



m 


h(x) 


4 


x b + a; 4 + x A + 1 


6 


X 1 + X a + + 1 


8 


X us + x n + x iu + x a + x s _|_ x i _|_ x 6 + 1 


10 


x Ab + x & + X M + X 1\S + x Lb + j.14 + x i6 + x ±a 
+ X 10 + X 9 + X 7 + X h + X 4 + X 3 + X + 1 



TABLE I 

Parity check polynomials for different values of m when i = 3. 




5 6 7 8 ~ 9 10 11 12 13 

m 



Fig. 1. Degree of h(x) for different values of m and i. 

and 



5) Let A( 2 m_i) x (2ii<sg(>o-i) be the binary matrix composed 
of even parity code vectors as its columns, i.e., if 
columns are considered as polynomial coefficients (in 
GF(2)[x}), each polynomial should be divisible by 
(x + l)g(x) (the additional factor of x + 1 implies the 
even parity). 

6) Replace all the zeros in A by —1 and normalize each 
column to obtain the final compressed sensing matrix 

(A( 2 ft_l) x (2<ies(f l )-l))- 

For a simple example, we consider the case m = i. It 
is easy to check that the number of l's in each of the 
binary sequences in step [2] cannot exceed one. Therefore, 
we have H d ec = {0, 2°, 2 1 , 2 2 , . . . , 2 2 '" 1 }. This means that 
h(x), except for the factor (x + 1) is the same as the 
minimal polynomial of a (the primitive root). Since for code 
generation, we use (x + l)g(x) instead of g(x), the effective 
h(x] will be the minimal polynomial of a which is a primitive 
polynomial. In this case, the matrix A is the (2 l — 1) x {2 l — 1) 
square matrix whose columns are circularly shifted versions 
of the Pseudo Noise Sequence (PNS) output generated by the 
primitive polynomial (the absolute value of the inner product 
of each two columns of A is exactly ^rUj- ). 

Table U summarizes some of the parity check polynomials 
for i = 3 (useful for k < 8). Also, Fig. Q] shows the degree of 
h(x) for some of the choices of fn and i. 

B. Reconstruction from the samples 

Matching Pursuit is one of the simplest methods for the 
recovery of sparse signals from sampling matrices (linear 
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projections). Here we show that this method can exactly 
recover the sparse signal from noiseless samples. 

Let A mx „ and s nx i be the sampling matrix and the k- 
sparse signal vector, respectively. The sampling process is 
defined by: 



Ymxl 



s nxl 



(24) 



For unique reconstruction of s„ x i from the samples y mx i, 
it is sufficient that the sampling matrix A mXn satisfies RIP 
of order 2k (8). In this section, we show that if A mx „ is 
constructed as described in previous section and satisfies RIP 
of order 2k, the matching pursuit method can be used for 
perfect reconstruction. In addition, due to the circular structure 
of the columns in A mx „, the computational complexity can 
be decreased (less than the ordinary matching pursuit). 

Let {ix, . . . , ik} C {1, . . . , n} be the nonzero locations in 
s nx i; thus, we have: 



Ymxi = A • s = s^.a^ 

3=1 



(25) 



where denotes the i th column in A. In the matching pursuit 
method, in order to find the nonzero locations in s, the inner 
products of the sample-vector (y) with all the columns of A 
are evaluated and then, the index of the maximum value (in 
absolute) is chosen as the most probable nonzero location. 
Here, we show that the index associated with the maximum 
value is always a nonzero location. Without loss of generality, 
assume |sjj > \si 2 \ > ... > \si k \. We then have: 



= | ^ s ij ( a ij > a n ) I 
3=1 

k 

— l s iil( a U! a u) — ^ l s ijll( a iji a ii) 
1 k 



3=2 



> 



k - 1 



k 



11 2k-V nl ~ 2k-V n] 
Now assume I £ {1, ... , n}\{ii, . . . , ik}: 
k 

\{y, a i)\ = \J2 s ij(^^ a i)\ 

3=1 

k 



(26) 



Combining 

l(y, a i)l < 



3=1 

< ^iEK-i 

3=1 

and (1271 , we get: 

k 



< 



k 



2k- r 



(27) 



(28) 



2k- 1' 

Hence, the largest inner product is obtained either with a ix 
or one of the other a, 's. Therefore, in the noiseless case, we 
never select a nonzero location by using the matching pursuit 



algorithm, and finally we reconstruct the original sparse signal 
perfectly. 

In each recursion of the matching pursuit algorithm, the 
inner product of y TOX i with all the columns in A mxra needs 
to be calculated. Each inner product requires m multiplications 
and m— 1 additions. Now we observe that the circular property 
of the columns of A can be useful. Let a be one of the 
columns in A and be its j th circularly shifted version. We 
observe that {aW}j are all columns of A; thus, (aW,y) has 
to be calculated for all j. Let {a^, a^ 2 ^ , . . . , a^} be different 
elements of {a")}j (obviously /i < to and more precisely 
fi\m). These inner products require \im multiplications and 
fi(m — 1) additions if directly calculated. 

An alternative approach for evaluation of these values 
is to employ Discrete Fourier Transform (DFT) or its fast 
implementation-FFT. The key point in this approach is that 
the inner products can be found through circular convolution 
of y and a, i.e., 



(y,a y) ) = yi 



(29) 



where © m represents the circular convolution with period m. 
It is well-known that the circular convolution can be easily 
calculated using DFT: if y/ and a/ denote the DFT of y and 
a, respectively, we have: 



IDFT{y f Qa f } = [y © m a| . . . , y © m a| 



(30) 



where v mx i u mxi = [viUi,. . . , v m u m ] T . For evaluation 
of the inner products in this way, y/has to be calculated 
only once using DFT. Thus, excluding the calculation of y/ 
(which is done only once), the inner products of y with 
{ay'}j require one DFT, one I DFT and m multiplications. 
Since /i different circular shifts of a are possible, at most 
fjL coefficients of a/ at equi-distance positions are nonzero; 
hence, /i-point DFT (and consequently IDFT) of a mx i rather 
than the general m-point DFT is adequate. For /i-point DFT 
of y, we can simply down-sample the evaluated to x 1 
vector of y/ (note that /Lt|m) and there is no need for an 
extra /i-point DFT. Employing the FFT version, we require 
2/i[log 2 fi\ multiplications and to — /i + 2/i[log 2 /i] additions 
per /i-point DFT or IDFT. Comparing the number of required 
multiplications in calculation of the above fi inner products 
reveal the efficiency of the DFT approach; i.e., the required 
computational complexity for reconstruction of the signal from 
the samples obtained from the sampling matrix is less than the 
common amount for general matrices. It should be emphasized 
that this reduction in computational complexity is the result 
of the circular format of the columns. 

V. Matrices with {0, 1, -1} Elements 

We have presented a method to generate RIP-fulfilling 
matrices with ±1 elements. In this section, we show that 
the matrices introduced in flOl can be improved using our 
technique in this paper. 

In iflOll . in contrast to this paper, binary compressed sensing 
matrices are considered. The main difficulty in designing 
such matrices is that the columns should (almost) be normal 
which means that prior to normalization, the number of l's in 
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each column is fixed (matrix elements are all scaled with the 



same coefficient for normalization). In iflOl , p 2 



x p 



r+l 



binary 



matrices are introduced such that in each column, exactly p 
elements are equal to 1 (equal to after normalization) and 
the inner product of each two columns is less than or equal to 
r (jj after normalization). Here p is a power of a prime integer; 
the matrix construction is based on polynomials in GF(p). 

It is evident that by changing some of the l's in the afore- 
mentioned matrix into — 1, the norm of the columns does not 
change; however, the inner products change. To show how we 
can benefit from this feature, let us assume that p = 2 l ; thus, 
there are 2 % nonzero elements in each column. We construct 
a new matrix from the original binary matrix as follows: we 
repeat each column 2 % times and then change the sign of the 
nonzero elements in the replicas in such a way that these 
nonzero elements form a Walch-Hadamard matrix. In other 
words, for each column, there are 2 % columns (including itself) 
that have the same pattern of nonzero elements. Moreover, the 
nonzero elements of these semi-replica vectors are different 
columns of the Walch-Hadamard matrix. Thus, the semi- 
replica vectors are orthogonal and the absolute value of the 
inner product of two vectors with different nonzero patterns is 
upper-bounded by r (maximum possible value in the original 
matrix). Hence, the new matrix still satisfies the RIP condition 
with the same k and 5k- 

Although we have expanded the matrix with this trick, 
the change is negligible when the order of matrix sizes are 
considered (p 2 x p r+1 is expanded to p 2 x p r+2 ). In fact, the 
orthogonality of the semi-replicas is not a necessary condition; 
we only need that their inner products do not exceed r in 
absolute value. It shows that instead of the Walch-Hadamard 
matrix, we can use other ±1 matrices with more number 
of columns (with the same number of rows) such that their 
columns are almost orthogonal (inner product less than r). 
This is the case for the matrices introduced in the previous 
sections. 

In order to mathematically describe the procedure, we need 
to define an operation. Let s be a j3 x 1 binary vector with ex- 
actly a elements of 1 in locations r%, . . . , r a E {1,2,..., /?}. 
Also, let x ax i = ■ ■ ■ , x a ] T be an arbitrary vector. We 
define ypxi = m( s j x ) as: 

V 1 < j < a : y T] = Xj 

Vj i {rx,...,r a } : y 3 =0 



(31) 



From the above definition, we can see: 

(m(s,x x ) , fJ,(s,x 2 )} = (xi,x 2 ) 



(32) 



Furthermore, if the elements of both xi , x 2 lie in the closed 
interval [—1,1], we have: 



|(/i(si,xi) , /i(s 2 ,x 2 ))| < (si,s 2 ) 



(33) 



For the matrix construction, let m be an integer such that 
p = 2 m — 1 is a prime (the primes of this form are called 
Mersenne primes). Let k < p be the required order of the RIP 
condition and let: 



Rog 2 k] 



Also let S p 2 xp r+i = [si ... s p r+i] be the binary 
RIP-fulfilling matrix constructed as in ifTUll and X px2 ,; = 

[xi ... x 2 fc] (k = \H < ^ L ~ lS> \ with the previous terminology) 
be the ±1 matrix introduced in the previous sections (we 
further normalize the columns of these matrices). We construct 
a new p 2 x (p r+1 .2 k ) matrix with elements in {0, 1,-1} by 
combining these two matrices: 

A=[(i{s i ,x j )] itj (35) 

Employing the same approach as used before, we show that A 
satisfies the RIP condition of order k, i.e., we show that the 
inner product of two different columns of A cannot exceed 
-r^r in absolute value while each column is normal: 

re— 1 

(36) 



( M s »> x i) > M(si,Xj) ) = (xj.Xj) = 1 



To study the inner product of fi(si 1 ,Xj 1 and ^(si 2 ,Xj 2 , we 



consider two cases: 

1) i\ = i2- In this case, since 

( M( S 'il' X jl) 1 A*( S <2) X j2j ) 



Sj 2 , we have: 



< 



1 



1 



(37) 



2) i\ 7^ %i and therefore, ^ Si 2 ; since the elements of 
both Xjj and xj 1 lie in [—1, 1], we have: 



( M( S ilJ X il) ! A*( S i2> X j2) ) 



< 



< 



k - 1 



(38) 



(34) 



Inequalities ([37]) and ([38]) hold due to the RIP-fulfilling 
structure of the matrices X and S. Hence, the claimed prop- 
erty of the inner products of the columns in A is proved. 
Consequently, A obeys the RIP condition of order k. 

VI. Conclusion 
Despite the enormous amount of literature in random sam- 
pling matrices for compressed sensing, deterministic designs 
are not well researched. In this paper, we introduce a new 
connection between the coding theory and RIP fulfilling 
matrices. In the new design, we replace the zeros in the 
binary linear code vectors by — 1 and use them as the columns 
of the sampling matrix in compressed sensing. The advan- 
tage of these matrices, in addition to their deterministic and 
known structure, is the simplicity in the sampling process; 
real/complex entries in the sampling matrix increases the com- 
putational complexity of the sampler as well as the required 
bit-precision for storing the samples. The linear codes for this 
purpose should have some desired characteristics; existence of 
such linear codes is proved by explicitly introducing binary 
BCH codes. One of the features of these matrices is that 
their produced samples can be easily (using matching pursuit 
method) decoded as the original sparse vector and due to the 
circular structure of the columns, the computational complex- 
ity in recovery can be reduced. These ±1 matrices are further 
expanded by considering {0, 1,-1} elements; this expansion 
is achieved by combining the ±1 matrices introduced in 
this paper with the Devore's binary matrices. Although the 
generated matrices show an improvement in the realizable 
size of the RIP-constrained matrices, the bound predicted by 
random matrices is not achieved yet. 
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Appendix A 
Evaluation of k 

In Theorem [2] we showed that k is equal to the number 
of binary sequences of length rh such that no two Is are 
spaced by less than m — I — 1 zeros (circular definition). To 
evaluate this number, let us define ri as the number of binary 
sequences of length 6 such that if the sequence is put around 
a circle, between each two l's, there is at least a zeros. In 
addition, let be the number of binary sequences such l's 
are spaced by at least a zeros apart (circular property is no 
longer valid for Kjf*). We first calculate «t and then we show 
the connection between «j and r b . 

There are two kinds of binary sequences counted in : 

1) The last bit in the sequence is 0; by omitting this bit, 
we obtain a sequence of length 6—1 with the same 
property. Also, each binary sequence of length 6 — 1 
with the above property can be padded by while still 
satisfying the required property to be included in 
Therefore, there are binary sequence of this type. 

2) The last bit in the sequence is 1; this means that the 
last a + 1 bits of the sequence are 0, . . . , 0, 1. Similar 

a 

to the above case, each binary sequence of length 6 — 
a — 1 counted in can be padded by the block 

0, . . . , 0, 1 to produce a sequence included in . Thus, 

a 

there are «;i^ _ 1 binary sequences of this type 



We thus have: 



(I + 6) -(1+5)^ = 1 
(l + <y)*((l + <5)^TT - 1 ) = 1 



(1 + <5)^t((1 + <5)-1)=]T(1 + <5); 

3=0 

a 

s(i + s)^ >y^i + ^!—s 

y ' ~ ^ a + 1 

3=0 

6(1 + 6) > |tf + a + l 

f . a-2.2 ^ (a-2) 2 1 /a + 4 

c-— ) > L 1 ^-+a+i> 



7 > 



a + 3\ °+ 



(45) 



Now we can show the connection between rj; and nf\ 
According to the definition of these parameters, we see that 
every binary sequence counted in r b is also counted in 
therefore: 

(46) 



_0) < (a) 
T b ^ K b 



In summary, we have the following recursive equation: In addition ' if a sequence counted in is padded with a 



zeros at the end, it satisfies the requirements to be counted in 



n b — n b-l ^ n b-a-l 

Since for b < a + 1, there can be at most one 1 in the binary 
sequence, we thus have: 



(39) >\ thus 



(a) (a) 
K b-a ^ T b 



1 < 6< a+ 1 



1 

» 



(40) 



From d39l , the last initial condition (n^li = a + 2) is 
equivalent to — 1. If we define the onesided Z-transform 
of as follows 



Combining the latter two inequalities, we get: 



0{j b - a ) < r b (a) < 0( 7 b ) 



(47) 



(48) 



r (a) 



(a). 



6=0 



it is not hard to check that: 

K (-)(z) = _L_ 



_a+l 



1 



„(a) 



1 



(41) 



(42) 



The above equation in conjunction with the result in (05]), 
yields: 



(49) 



Therefore, the increasing rate n^ 1 ' with respect to 6 (6 3> 1) 
has the same order as j b where 7 is the largest (in absolute 



value) root of f(z) = z 



— r a+l 



z a - 1. Since /(l) • /(2) < 0, 



there is a real root in (1 , 2); let us denote this root by 7. 
In fact, 7 is the largest root of f(z) (we do not prove this; 
however, if f(z) has a larger root, the increasing rate of Kjf^ 
would be greater than -f b ): 

1< 7 <2 , /( 7 )=7 Q+1 -7 a -l = (43) 

Since 7 > 1 we have 7 a+1 > 1. For the sake of simplicity, 
let us define: 



6± 



a+l 



1 



(44) 



The interpretation of the above inequality for k is as follows: 

k = r £- l -V >o((^ + l)^ (50) 

Figure |2] shows the asymptotic behavior of at different 
a values when b increases. 
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